Introduction to Data Lakes
Back to Home
01. Introduction
02. Lesson Overview
03. Why Data Lakes: Evolution of the Data Warehouse
04. Why Data Lakes: Unstructured & Big Data
05. Why Data Lakes: New Roles & Advanced Analytics
06. Big Data Effects: Low Costs, ETL Offloading
07. Big Data Effects: Schema-on-Read
08. Big Data Effects: (Un-/Semi-)Structured support
09. Demo: Schema On Read Pt 1
10. Demo: Schema On Read Pt 2
11. Demo: Schema On Read Pt 3
12. Demo: Schema On Read Pt 4
13. Exercise 1: Schema On Read
14. Demo: Advanced Analytics NLP Pt 1
15. Demo: Advanced Analytics NLP Pt 2
16. Demo: Advanced Analytics NLP Pt 3
17. Exercise 2: Advanced Analytics NLP
18. Data Lake Implementation Introduction
19. Data Lake Concepts
20. Data Lake vs Data Warehouse
21. Data Lake Options on AWS
22. AWS Options: EMR (HDFS + Spark)
23. AWS Options: EMR: S3 + Spark
24. AWS Options: Athena
25. Demo: Data Lake on S3 Pt 1
26. Demo: Data Lake on S3 Pt 2
27. Exercise 3: Data Lake on S3
28. Demo: Data Lake on EMR Pt 1
29. Demo: Data Lake on EMR Pt 2
30. Demo: Data Lake on Athena Pt 1
31. Demo: Data Lake on Athena Pt 2
32. Data Lake Issues
33. [AWS] Launch EMR Cluster and Notebook
34. [AWS] Avoid Paying Unexpected Costs
Back to Home
01. Introduction
L04-00-Intro
Next Concept